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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/567 , 749 



DATE: 02/28/2006 
TIME: 12:03:09 



IFWP 



Input Set : A:\pto.da.txt 

Output Set: N:\CRF4\02282006\J567749.raw r . 
3 <110> APPLICANT: Degussa AG^i20> Process for the preparation of L-threonin< 

030235 

C--> 4 <140> CURRENT APPLICATION NUMBER: US/10/567,749 
C--> 4 <141> CURRENT FILING DATE: 2006-02-10 



E--> 4 <160> NUMBER OF SEQ ID: 10 *<170> Patentln version 3.1<5l0> 1<211> 993^212> DNjf^213> 



I ■ 

: 2^11> 330^12> PRT^213> Escherichia coli^C 



Corrected Diskette Needed 



E--> 92 <210> SEQ ID NO: 25211> 33<re212> PRT£213> Escherichia coli*T400> 2 

e- - > 93 Does Not Comply 

E--> 93 

E--> 93 ' 
E--> 93 <400> SEQUENCE: 

93 Met Ser Gin Asn Thr Leu Lys Val His Asp Leu Asn Glu Asp Ala Glu 

94 1 5 10 15 

96 Phe Asp Glu Asn Gly Val Glu Val Phe Asp Glu Lys Ala Leu Val Glu 

97 20 25 30 
99 Gin Glu Pro Ser Asp Asn Asp Leu Ala Glu Glu Glu Leu Leu Ser Gin 



100 






35 










40 










45 








102 


Gly 


Ala 


Thr 


Gin 


Arg 


Val 


Leu 


Asp 


Ala 


Thr 


Gin 


Leu 


Tyr 


Leu 


Gly 


Glu 


103 




50 










55 










60 










105 


He 


Gly 


Tyr 


Ser 


Pro 


Leu 


Leu 


Thr 


Ala 


Glu 


Glu 


Glu 


Val 


Tyr 


Phe 


Ala 


106 


65 










70 










75 










80 


109 


Arg 


Arg 


Ala 


Leu 


Arg 


Gly 


Asp 


Val 


Ala 


Ser 


Arg 


Arg 


Arg 


Met 


He 


Glu 


110 










85 










90 










95 




112 


Ser 


Asn 


Leu 


Arg 


Leu 


Val 


Val 


Lys 


He 


Ala 


Arg 


Arg 


Tyr 


Gly 


Asn 


Arg 


113 








100 










105 










110 






115 


Gly 


Leu 


Ala 


Leu 


Leu 


Asp 


Leu 


He 


Glu 


Glu 


Gly 


Asn 


Leu 


Gly 


Leu 


He 


116 






115 










120 










125 








118 


Arg 


Ala 


Val 


Glu 


Lys 


Phe 


Asp 


Pro 


Glu 


Arg 


Gly 


Phe 


Arg 


Phe 


Ser 


Thr 


119 




130 










135 










140 










121 


Tyr 


Ala 


Thr 


Trp 


Trp 


He 


Arg 


Gin 


Thr 


He 


Glu 


Arg 


Ala 


He 


Met 


Asn 


122 


145 










150 










155 










160 


124 


Gin 


Thr 


Arg 


Thr 


He 


Arg 


Leu 


Pro 


He 


His 


He 


Val 


Lys 


Glu 


Leu 


Asn 


125 










165 










170 










175 




127 


Val 


Tyr 


Leu 


Arg 


Thr 


Ala 


Arg 


Glu 


Leu 


Ser 


His 


Lys 


Leu 


Asp 


His 


Glu 


128 








180 










185 










190 






130 


Pro 


Ser 


Ala 


Glu 


Glu 


He 


Ala 


Glu 


Gin 


Leu 


Asp 


Lys 


Pro 


Val 


Asp 


Asp 


131 






195 










200 










205 








133 


Val 


Ser 


Arg 


Met 


Leu 


Arg 


Leu 


Asn 


Glu 


Arg 


He 


Thr 


Ser 


Val 


Asp 


Thr 


134 




210 










215 










220 
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136 


Pro 


Leu 


Gly Gly Asp 


Ser 


Glu Lys 


Ala 


Leu 


Leu Asp 


He 


Leu 


Ala Asp 


137 


225 






230 








235 








240 


139 


Glu 


Lys 


Glu Asn Gly 


Pro 


Glu Asp Thr 


Thr 


Gin Asp 


Asp 


Asp 


Met 


Lys 


140 






245 








250 








255 




142 


Gin 


Ser 


He Val Lys 


Trp 


Leu Phe 


Glu 


Leu 


Asn Ala 


Lys 


Gin 


Arg 


Glu 


143 






260 






265 








270 






145 


Val 


Leu 


Ala Arg Arg 


Phe 


Gly Leu 


Leu 


Gly 


Tyr Glu 


Ala 


Ala 


Thr 


Leu 


146 






275 




280 








285 








148 


Glu Asp 


Val Gly Arg 


Glu 


He Gly 


Leu 


Thr 


Arg Glu 


Arg 


Val 


Arg 


Gin 


149 




290 






295 






300 










151 


He 


Gin 


Val Glu Gly 


Leu 


Arg Arg 


Leu 


Arg 


Glu He 


Leu 


Gin 


Thr 


Gin 


152 


305 






310 








315 








320 



154 Gly Leu Asn He Glu Ala Leu Phe Ara Glu ^ J j 

155 325^' hjO<^<^ MJl^l^ J / J 
E--> 157 <210> SEQ ID NO: 3^211> 993<212> DN^213> Escherichia coli<220>«21> Allele4?222> 

158 <223> OTHER INFORMATION : rpoS allele 
W--> 160 < 

E--> 162 

T? » i c o ^ . . 

a- - > xoz • - • 1 ; " ■ ■ 1 

162 <400> SEQUENCE: 3 

163 atgagtcaga atacgctgaa agttcatgat ttaaatgaag atgcggaatt tgatgagaac 60 
165 ggagttgagg tttttgacga aaaggcctta gtagaatagg aacccagtga taacgatttg 120 
167 gccgaagagg aactgttatc gcagggagcc acacagcgtg tgttggacgc gactcagctt 180 
169 taccttggtg agattggtta ttcaccactg ttaacggccg aagaagaagt ttattttgcg 240 
171 cgtcgcgcac tgcgtggaga tgtcgcctct cgccgccgga tgatcgagag taacttgcgt 300 
173 ctggtggtaa aaattgcccg ccgttatggc aatcgtggtc tggcgttgct ggaccttatc 360 
175 gaagagggca acctggggct gatccgcgcg gtagagaagt ttgacccgga acgtggtttc 420 
177 cgcttctcaa catacgcaac ctggtggatt cgccagacga ttgaacgggc gattatgaac 480 
179 caaacccgta ctattcgttt gccgattcac atcgtaaagg agctgaacgt ttacctgcga 540 
181 accgcacgtg agttgtccca taagctggac catgaaccaa gtgcggaaga gatcgcagag 600 
183 caactggata agccagttga tgacgtcagc cgtatgcttc gtcttaacga gcgcattacc 660 
185 tcggtagaca ccccgctggg tggtgattcc gaaaaagcgt tgctggacat cctggccgat 72 0 
187 gaaaaagaga acggtccgga agataccacg caagatgacg atatgaagca gagcatcgtc 780 
189 aaatggctgt tcgagctgaa cgccaaacag cgtgaagtgc tggcacgtcg attcggtttg 840 
191 ctggggtacg aagcggcaac actggaagat gtaggtcgtg aaattggcct cacccgtgaa 900 
193 cgtgttcgcc agattcaggt tgaaggcctg cgccgtttgc gcgaaatcct gcaaacgcag 960 
195 gggctgaata tcgaagcgct gttcccjcgag taa^ y/ y/> 993 / 

E--> 197 <210> SEQ ID NO: 4^211> 7S«12> DNA^213> Escherichia coli<220>5T221> tRNi*«22> 

198 <223> OTHER INFORMATION: supE allele 
E--> 200 < 
E--> 200 
E--> 200 

E--> 200 <400> SEQUENCE: 4 

201 tggggtatcg ccaagcggta aggcaccgga ttctaattcc ggcattccga ggttcgaatc 60 

203 ctcgtacccc agcca ^ ^/ / ^ / 75 / 

E--> 207 <210> SEQ ID NO: K211> 1545<212> DNA^213> Escherichia coli^20^d21> CDS^222> 
208 <223> OTHER INFORMATION: ilvA-Gen 

E--> 210 
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E--> 210 ■ 
E--> 210 * 

E--> 210 <400> SEQUENCE: 5 

211 atg get gac teg caa ccc ctg tec ggt get ccg gaa ggt gec gaa tat 48 

212 Met Ala Asp Ser Gin Pro Leu Ser Gly Ala Pro Glu Gly Ala Glu Tyr 

213 15 10 15 

215 tta aga gca gtg ctg cgc gcg ccg gtt tac gag gcg gcg cag gtt acg 96 

216 Leu Arg Ala Val Leu Arg Ala Pro Val Tyr Glu Ala Ala Gin Val Thr 

217 20 25 30 

219 ccg eta caa aaa atg gaa aaa ctg teg teg cgt ctt gat aac gtc att 144 

220 Pro Leu Gin Lys Met Glu Lys Leu Ser Ser Arg Leu Asp Asn Val lie 

221 35 40 45 

223 ctg gtg aag cgc gaa gat cgc cag cca gtg cac age ttt aag ctg cgc 192 

224 Leu Val Lys Arg Glu Asp Arg Gin Pro Val His Ser Phe Lys Leu Arg 

225 50 55 60 

227 ggc gca tac gee atg atg gcg ggc ctg acg gaa gaa cag aaa gcg cac 240 

228 Gly Ala Tyr Ala Met Met Ala Gly Leu Thr Glu Glu Gin Lys Ala His 

229 65 70 75 80 
. .. ?3? r-y- qta ate p."*- ^t. . f "^ ,+ " acg ggt aac rac gcg cag ggc . c gcg ttt- . 2? ? f : 

255" vi'iy' Vtl.-/ ±le Titr A±a Ser" Ala v*iy asn' his Ala GTnT Gi'y = val "Pile ' ; 4 

233 85 90 95 

235 tct tct gcg egg tta ggc gtg aag gee ctg ate gtt atg cca ace gee 336 

236 Ser Ser Ala Arg Leu Gly Val Lys Ala Leu lie Val Met Pro Thr Ala 

237 100 105 110 

239 ace gee gac ate aaa gtc gac gcg gtg cgc ggc ttc ggc ggc gaa gtg 384 

240 Thr Ala Asp lie Lys Val Asp Ala Val Arg Gly Phe Gly Gly Glu Val 

241 115 120 125 

243 ctg etc cac ggc gcg aac ttt gat gaa gcg aaa gec aaa gcg ate gaa 432 

244 Leu Leu His Gly Ala Asn Phe Asp Glu Ala Lys Ala Lys Ala lie Glu 

245 130 135 140 

247 ctg tea cag cag cag ggg ttc acc tgg gtg ccg ccg ttc gac cat ccg 480 

248 Leu Ser Gin Gin Gin Gly Phe Thr Trp Val Pro Pro Phe Asp His Pro 

249 145 150 155 160 

251 atg gtg att gee ggg caa ggc acg ctg gcg ctg gaa ctg etc cag cag 528 

252 Met Val lie Ala Gly Gin Gly Thr Leu Ala Leu Glu Leu Leu Gin Gin 

253 165 170 175 

255 gac gee cat etc gac cgc gta ttt gtg cca gtc ggc ggc ggc ggt ctg 576 

256 Asp Ala His Leu Asp Arg Val Phe Val Pro Val Gly Gly Gly Gly Leu 

257 180 185 190 

259 get get ggc gtg gcg gtg ctg ate aaa caa ctg atg ccg caa ate aaa 624 

260 Ala Ala Gly Val Ala Val Leu lie Lys Gin Leu Met Pro Gin lie Lys 

261 195 200 205 

264 gtg ate gec gta gaa gcg gaa gac tec gee tgc ctg aaa gca gcg ctg 672 

265 Val lie Ala Val Glu Ala Glu Asp Ser Ala Cys Leu Lys Ala Ala Leu 

266 210 215 220 

268 gat gcg ggt cat ccg gtt gat ctg ccg cgc gta ggg eta ttt get gaa 72 0 

269 Asp Ala Gly His Pro Val Asp Leu Pro Arg Val Gly Leu Phe Ala Glu 

270 225 230 235 240 

272 ggc gta gcg gta aaa cgc ate ggt gac gaa acc ttc cgt tta tgc cag 768 



file://C:\CRF4\Outhold\VsrJ567749.htm 



2/28/2006 



RAW SEQUENCE LISTING DATE: 02/28/2006 

PATENT APPLICATION: US/10/567 f 749 TIME: 12:03:09 



Input Set : A:\pto.da.txt 

Output Set: N:\CRF4\02282006\J567749.raw 



273 


Gly 


Val 


Ala 


Val 


Lys 


Arg 


He 


Gly 


Asp 


Glu 


Thr 


Phe 


Arg 


Leu 


Cys 


Gin 




274 










245 










250 










255 






276 


gag 


tat 


etc 


gac 


gac 


ate 


ate 


ace 


gtc 


gat 


age 


gat 


gcg 


ate 


tgt 


gcg 


816 


277 


Glu 


Tyr 


Leu 


Asp 


Asp 


He 


He 


Thr 


Val 


Asp 


Ser 


Asp Ala 


He 


Cys 


Ala 




278 








260 










265 










270 








280 


gcg 


atg 


aag 


gat 


tta 


ttc 


gaa 


gat 


gtg 


cgc 


gcg 


gtg 


gcg 


gaa 


ccc 


tct 


864 


281 


Ala 


Met 


Lys 


Asp 


Leu 


Phe 


Glu 


Asp 


Val 


Arg 


Ala 


Val 


Ala 


Glu 


Pro 


Ser 




282 






275 










280 










285 










284 


ggc 


gcg 


ctg 


gcg 


ctg 


gcg 


gga 


atg 


aaa 


aaa 


tat 


ate 


gee 


ctg 


cac 


aac 


912 


285 


Gly 


Ala 


Leu 


Ala 


Leu 


Ala 


Gly 


Met 


Lys 


Lys 


Tyr 


He 


Ala 


Leu 


His 


Asn 




286 




290 










295 










300 












288 


att 


cgc 


ggc 


gaa 


egg 


ctg 


gcg 


cat 


att 


ctt 


tec 


ggt 


gee 


aac 


gtg 


aac 


960 


289 


He 


Arg 


Gly 


Glu 


Arg 


Leu 


Ala 


His 


He 


Leu 


Ser 


Gly Ala 


Asn 


Val 


Asn 




290 


305 










310 










315 










320 




2 92 


ttc 


cac 


ggc 


ctg 


cgc 


tac 


gtc 


tea 


gaa 


cgc 


tgc 


gaa 


ctg 


ggc 


gaa 


cag 


1008 


293 


Phe 


His 


Gly 


Leu 


Arg 


Tyr 


Val 


Ser 


Glu 


Arg 


Cys 


Glu 


Leu 


Gly 


Glu 


Gin 




2 94 










325 










330 










335 






296 


cgt 


gaa 


gcg 


ttg 


ttg 


gcg 


gtg 


ace 


att 


ccg 


gaa 


gaa 


aaa 


ggc 


age 


ttc 


1056 


»-»!., x 


■ *• -< 




. a.I a 


T«eu 




A?. 1 . 


*'?1 


Thr 


I]e 


Pro 




G"^i. 


Lys 


Gly 


Ser. 


D h? 








• 
























55XT 




- 




300 


etc 


aaa 


ttc 


tgc 


caa 


ctg 


ctt 


ggc 


ggg 


cgt 


teg 


gtc 


ace 


gag 


ttc 


aac 


1104 


301 


Leu 


Lys 


Phe 


Cys 


Gin 


Leu 


Leu 


Gly 


Gly 


Arg 


Ser 


Val 


Thr 


Glu 


Phe 


Asn 




302 






355 










360 










365 




• 






304 


tac 


cgt 


ttt 


gee 


gat 


gee 


aaa 


aac 


gee 


tgc 


ate 


ttt 


gtc 


ggt 


gtg 


cgc 


1152 


305 


Tyr 


Arg 


Phe 


Ala 


Asp 


Ala 


Lys 


Asn 


Ala 


Cys 


He 


Phe 


Val 


Gly 


Val 


Arg 




306 




370 










375 










380 












308 


ctg 


age 


cgc 


ggc 


etc 


gaa 


gag 


cgc 


aaa 


gaa 


att 


ttg 


cag 


atg 


etc 


aac 


1200 


309 


Leu 


Ser 


Arg 


Gly 


Leu 


Glu 


Glu 


Arg 


Lys 


Glu 


He 


Leu 


Gin 


Met 


Leu 


Asn 




310 


385 










390 










395 










400 




312 


gac 


ggc 


ggc 


tac 


age 


gtg 


gtt 


gat 


etc 


tec 


gac 


gac 


gaa 


atg 


gcg 


aag 


1248 


313 


Asp 


Gly Gly 


Tyr 


Ser 


Val 


Val 


Asp 


Leu 


Ser 


Asp 


Asp 


Glu 


Met 


Ala 


Lys 




314 










405 










410 










415 






316 


eta 


cac 


gtg 


cgc 


tat 


atg 


gtc 


ggc 


gga 


cgt 


cca 


teg 


cat 


ccg 


ttg 


cag 


1296 


317 


Leu 


His 


Val 


Arg 


Tyr 


Met 


Val 


Gly 


Gly 


Arg 


Pro 


Ser 


His 


Pro 


Leu 


Gin 




318 








420 










425 










430 








320 


gaa 


cgc 


etc 


tac 


age 


ttc 


gaa 


ttc 


ccg 


gaa 


tea 


ccg 


ggc 


gcg 


ctg 


ctg 


1344 


321 


Glu 


Arg 


Leu 


Tyr 


Ser 


Phe 


Glu 


Phe 


Pro 


Glu 


Ser 


Pro 


Gly Ala 


Leu 


Leu 




322 






435 










440 










445 










324 


cgc 


ttc 


etc 


aac 


acg 


ctg 


ggt 


acg 


tac 


tgg 


aac 


att 


tct 


ttg 


ttc 


cac 


1392 


325 


Arg 


Phe 


Leu 


Asn 


Thr 


Leu Gly 


Thr 


Tyr 


Trp Asn 


He 


Ser 


Leu 


Phe 


His 




326 




450 










455 










460 












329 


tat 


cgc 


age 


cat 


ggc 


ace 


gac 


tac 


ggg 


cgc 


gta 


ctg 


gcg 


gcg 


ttc 


gaa 


1440 


330 


Tyr 


Arg 


Ser 


His 


Gly 


Thr Asp 


Tyr 


Gly 


Arg. 


Val 


Leu 


Ala 


Ala 


Phe 


Glu 




331 


465 










470 










475 










480 




333 


ctt 


ggc 


gac 


cat 


gaa 


ccg 


gat 


ttc 


gaa 


ace 


egg 


ctg 


aat 


gag 


ctg 


ggc 


1488 


334 


Leu 


Gly Asp 


His 


Glu 


Pro 


Asp 


Phe 


Glu 


Thr 


Arg 


Leu 


Asn 


Glu 


Leu Gly 




335 










485 










490 










495 






337 


tac 


gat 


tgc 


cac 


gac 


gaa 


ace 


aat 


aac 


ccg 


gcg 


ttc 


agg 


ttc 


ttt 


ttg 


1536 


338 


Tyr 


Asp 


Cys 


His 


Asp 


Glu 


Thr 


Asn 


Asn 


Pro 


Ala 


Phe 


Arg 


Phe 


Phe 


Leu 
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-> 


345 


E- 


-> 


346 


E- 


-> 


346 


E- 


-> 


346 


W- 


-> 


346 
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339 500 505 510 

345 <210> SEQ ID NO: 6<211> 51C<212> PR1^13> Escherichia coli 



c400> SEQUENCE: 6 

347 Met Ala Asp Ser Gin Pro Leu Ser Gly Ala Pro Glu Gly Ala Glu Tyr 

348 15 10 15 

350 Leu Arg Ala Val Leu Arg Ala Pro Val Tyr Glu Ala Ala Gin Val Thr 

351 20 25 30 

353 Pro Leu Gin Lys Met Glu Lys Leu Ser Ser Arg Leu Asp Asn Val lie 

354 35 40 45 

356 Leu Val Lys Arg Glu Asp Arg Gin Pro Val His Ser Phe Lys Leu Arg 

357 50 55 60 

359 Gly Ala Tyr Ala Met Met Ala Gly Leu Thr Glu Glu Gin Lys Ala His 

360 65 70 75 80 

:.: 762 Gly v»l Tie ...T*^ aia.q**. *ta Gly .Asn His A3 a CI; .Gly Val Al a Phe 
' oo'S- 85~ y0 ■ 95 

365 Ser Ser Ala Arg Leu Gly Val Lys Ala Leu lie Val Met Pro Thr Ala 

366 100 105 110 

368 Thr Ala Asp He Lys Val Asp Ala Val Arg Gly Phe Gly Gly Glu Val 

369 115 120 125 

3 71 Leu Leu His Gly Ala Asn Phe Asp Glu Ala Lys Ala Lys Ala He Glu 
372 130 135 140 

3 74 Leu Ser Gin Gin Gin Gly Phe Thr Trp Val Pro Pro Phe Asp His Pro 
375 145 150 155 160 

3 77 Met Val He Ala Gly Gin Gly Thr Leu Ala Leu Glu Leu Leu Gin Gin 
378 165 170 175 

3 80 Asp Ala His Leu Asp Arg Val Phe Val Pro Val Gly Gly Gly Gly Leu 
381 180 185 190 

383 Ala Ala Gly Val Ala Val Leu He Lys Gin Leu Met Pro Gin He Lys 

384 195 200 205 

386 Val He Ala Val Glu Ala Glu Asp Ser Ala Cys Leu Lys Ala Ala Leu 

387 210 215 220 

3 90 Asp Ala Gly His Pro Val Asp Leu Pro Arg Val Gly Leu Phe Ala Glu 
391 225 230 235 240 

3 93 Gly Val Ala Val Lys Arg He Gly Asp Glu Thr Phe Arg Leu Cys Gin 
394 245 250 255 

3 96 Glu Tyr Leu Asp Asp He He Thr Val Asp Ser Asp Ala He Cys Ala 
397 260 265 270 

3 99 Ala Met Lys Asp Leu Phe Glu Asp Val Arg Ala Val Ala Glu Pro Ser 
400 275 280 285 

402 Gly Ala Leu Ala Leu Ala Gly Met Lys Lys Tyr He Ala Leu His Asn 

403 290 295 300 

405 He Arg Gly Glu Arg Leu Ala His He Leu Ser Gly Ala Asn Val Asn 

406 305 310 315 320 
408 Phe His Gly Leu Arg Tyr Val Ser Glu Arg Cys Glu Leu Gly Glu Gin 



1545 
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409 










325 










330 










335 






411 


Arg 


Glu 


Ala 


Leu 


Leu 


Ala 


Val 


Thr 


He 


Pro 


Glu 


Glu 


Lys 


Gly 


Ser 


Phe 




412 








340 










345 










350 








414 


Leu Lys 
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I" ill Z " 9 / / / w»x ^ 1545 

E--> 590 <210> SEQ ID NO: 8^211> 514^212> PRT«13> Escherichia coli 
E--> 591 
E--> 591 
E--> 591 

W--> 591 <400> SEQUENCE: 8 

592 Met Ala Asp Ser Gin Pro Leu Ser Gly Ala Pro Glu Gly Ala Glu Tyr 

593 15 10 15 

595 Leu Arg Ala Val Leu Arg Ala Pro Val Tyr Glu Ala Ala Gin Val Thr 

596 20 25 30 

598 Pro Leu Gin Lys Met Glu Lys Leu Ser Ser Arg Leu Asp Asn Val He 

599 35 40 45 

601 Leu Val Lys Arg Glu Asp Arg Gin Pro Val His Ser Phe Lys Leu Arg 

602 50 55 60 

604 Gly Ala Tyr Ala Met Met Ala Gly Leu Thr Glu Glu Gin Lys Ala His 

605 65 70 75 80 

607 Gly Val He Thr Ala Ser Ala Gly Asn His Ala Gin Gly Val Ala Phe 

608 85 90 95 

610 Ser Ser Ala Arg Leu Gly Val Lys Ala Leu He Val Met Pro Thr Ala 
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686 Tyr Asp Cys His Asp Glu Thr Asn Asn Pro Ala Phe Arg Phe Phe Leu 

687 500 . 505 510 
689 Ala Gly / V / 

E--> 692 <210> SEQ ID NO: 9<e211> 1548^212> DNJ^2 13 > Escherichia coli 
W--> 694 <220> FFATURE:/^221> DNJ^222> (1) . . (1548)^223> 
W--> 695 . . 
E--> 69f 

E--> 698 " ' 

E--> 698 

698 <400> SEQUENCE: 9 

699 tcgcgatctg gtactgtaag gggaaataga gatgacacac gataataaat tgcaggttga 60 
701 agctattaaa cgcggcacgg taattgacca tatccccgcc cagatcggtt ttaagctgtt 12 0 
703 gagtctgttc aagctgaccg aaacggatca gcgcatcacc attggtctga acctgccttc 180 
705 tggcgagatg ggccgcaaag atctgatcaa aatcgaaaat acctttttga gtgaagatca 240 

" 707 agtagatcaa ctggcattgt atgcgccgca agccacggtt aaccgtatcg acaactatga 300 

709 agtggtgggt aaatcgcgcc caagtctgcc ggagcgcatc gacaatgtgc tggtctgccc 360 

711 gaacagcaac tgtatcagcc atgccgaacc ggtttcatcc agctttgccg tgcgaaaacg 420 

713 cgccaatgat atcgcgctca aatgcaaata ctgtgaaaaa gagttttccc ataatgtggt 480 

71.5 grV qg-\: taattgrgg.t. trj~*- ^^Z?^— ■ cctata at£ • ,.^g ",5 

716"** rtersei Gin 

717 1 

719 act ttt tac cgc tgt aat aaa gga gaa ate atg age aaa act ate gcg 583 

720 Thr Phe Tyr Arg Cys Asn Lys Gly Glu lie Met Ser Lys Thr lie Ala 

721 5 10 15 

723 acg gaa aat gca ccg gca get ate ggt cct tac gta cag ggc gtt gat 631 

724 Thr Glu Asn Ala Pro Ala Ala lie Gly Pro Tyr Val Gin Gly Val Asp 

725 20 25 30 35 

72 7 ctg ggc aat atg ate ate acc tec ggt cag ate ccg gta aat ccg aaa 679 

72 8 Leu Gly Asn Met lie lie Thr Ser Gly Gin lie Pro Val Asn Pro Lys 
729 40 45 50 

731 acg ggc gaa gta ccg gca gac gtc get gca cag gca cgt cag teg ctg 727 

732 Thr Gly Glu Val Pro Ala Asp Val Ala Ala Gin Ala Arg Gin Ser Leu 

733 55 60 65 

735 gat aac gta aaa gcg ate gtc gaa gee get ggc ctg aaa gtg ggc gac 775 

736 Asp Asn Val Lys Ala lie Val Glu Ala Ala Gly Leu Lys Val Gly Asp 

737 70 75 80 

73 9 ate gtt aaa act acc gtg ttt gta aaa gat ctg aac gac ttc gca acc 823 

740 lie Val Lys Thr Thr Val Phe Val Lys Asp Leu Asn Asp Phe Ala Thr 

741 85 90 95 

743 gta aac gee act tac gaa gee ttc ttc acc gaa cac aac gee acc ttc 871 

744 Val Asn Ala Thr Tyr Glu Ala Phe Phe Thr Glu His Asn Ala Thr Phe 

745 100 105 110 115 

747 ccg gca cgt tct tgc gtt gaa gtt gee cgt ctg ccg aaa gac gtg aag 919 

748 Pro Ala Arg Ser Cys Val Glu Val Ala Arg Leu Pro Lys Asp Val Lys 

749 120 125 130 

752 att gag ate gaa gcg ate get gtt cgt cgc taa tcttgatgga aatceggget 972 

753 He Glu He Glu Ala He Ala Val Arg Arg 

754 135 140 

756 atcatgcccg gattaagtct gatgacaaac geaaaatege ctgatgeget aegcttatea 1032 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/567,749 



DATE: 02/28/2006 
TIME: 12:03:09 



Input Set : A:\pto.da.txt 

Output Set: N:\CRF4\02282006\J567749.raw 



758 ggcctacgtg attcctgcaa tttattgaat ttgttggccg gataaggcat ttacgccgca 

760 tccggcatga acaaaactca ctttgtctac aatctgaatc ggggctatcg tgcccagttt 

762 attctttatt gccagccgta acgacggcta tagaaccctt tcaccaactg ggttaatgtc 

764 atataccctg ccagaatcgc aaccagccac gggaaatagc ttaacggcag cgcctgtaat 

766 tgcagataac tggccagcgg tgaaaacggc aatgcgatcc cgacaatcat cacgatcacg 

768 gtcatgatca ttaacggcca cgatgcacag ctctgaataa acggcacacg gcgggtgcgg 

770 atcatatgca caatcagcgt ttgcgacagt aagcccacca caaaccatcc cgactggaac 

772 agcgtttgcg tttccggcgt gttggcatgg aatacccacc acatcaggca aaacgtcaaa 



E --> 777 <210> SEQ ID NO: lrfsc211> 141<212> PRT^213> Escherichia coli 
E- -> 776 ^ 
E--> 778 < 

E--> 778 ;: 
W--> 778 <400> SEQUENCE: 10 



779 


Met 


Ser 


Gin 


Thr Phe Tyr 


Arg 


Cys 


Asn 


Lys 


Gly 


Glu 


He 


Met 


Ser 


Lys 


780 


1 






5 








10 










15 




782 


Thr 


He 


Ala 


Thr Glu Asn 


Ala 


Pro 


Ala 


Ala 


He 


Gly 


Pro 


Tyr 


Val 


Gin 


783 








20 






25 










30 






1 QC 

• '••>•• 


m \? 

- 




Asp 


Lev. GTy .Asn 




- • i » 


He 


Thr 








Tl o 






/ OD 






























788 


Asn 


Pro 


Lys 


Thr Gly Glu 


Val 


Pro 


Ala 


Asp 


Val 


Ala 


Ala 


Gin 


Ala 


Arg 


789 




50 






55 










60 










791 


Gin 


Ser 


Leu 


Asp Asn Val 


Lys 


Ala 


He 


Val 


Glu 


Ala 


Ala 


Gly Leu 


Lys 


792 


65 






70 










75 










80 


794 


Val 


Gly Asp 


He Val Lys 


Thr 


Thr 


Val 


Phe 


Val 


Lys 


Asp 


Leu 


Asn 


Asp 


795 








85 








90 










95 




797 


Phe 


Ala 


Thr 


Val Asn Ala 


Thr 


Tyr 


Glu 


Ala 


Phe 


Phe 


Thr 


Glu 


His 


Asn 


798 








100 






105 










110 






800 


Ala 


Thr 


Phe 


Pro Ala Arg 


Ser 


Cys 


Val 


Glu 


Val 


Ala 


Arg 


Leu 


Pro 


Lys 


801 






115 






120 










125 








803 


Asp 


Val 


Lys 


He Glu He 


Glu 


Ala 


He 


Ala 


Val 


Arg 


Arg 








804 




130 






135 










140 











1092 
1152 
1212 
1272 
1332 
1392 
1452 
1512 
1548 
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VERIFICATION SUMMARY DATE: 02/28/2006 

PATENT APPLICATION: US/10/567,749 TIME: 12:03:11 

Input Set : A:\pto.da.txt 

Output Set: N:\CRF4\02282006\J567749.raw 

L:4 M:270 C: Current Application Number differs, Replaced Current Application No 

L:4 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

L:0 M:282 E: Numeric Field Identifier Missing, <120> TITLE INVENTION 

L:0 M:201 W: Mandatory field data missing, <130> FILE REFERENCE 

L:4 M:283 W: Missing Blank Line separator, <160> field identifier 

L:5 M:281 W: Numeric Fields not Ordered, <220> not ordered! . 

L:5 M:283 W: Missing Blank Line separator, <220> field identifier 

L:5 M:256 W: Invalid Numeric Header Field, <220> has non-blank data 

L:7 M:282 E: Numeric Field Identifier Missing, <210> is required. 

L:7 M:282 E: Numeric Field Identifier Missing, <211> is required. 

L:7 M:282 E: Numeric Field Identifier Missing, <212> is required. 

L:7 M:282 E: Numeric Field Identifier Missing, <213> is required. 

L:7 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQUENCE ID NOS:0 differs:l 
L:92 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 2<211> 330<212> 
PRT<213> Escherichia coli<400> 2 

L:93 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:93 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:93 M:282 E: Numeric Field Identifier Missing, <213> is required. 

T.:93 Vi7 r " -:-V Mandatory Header. Fi<=>M missing, <400> is requiv** .. \ - ... ....... 

LiiST viz?±Z~ZT (.V*; ItrvalTo: or -duplicate Sequence*"!!; -Numoer; SEQ'"IC- 'NUr "2<2Tr> *9S<2rZ>~ — 
DNA<213> Escherichia coli<220><221> Allele<222> 

L:160 M:256 W: Invalid -Numeric Header Field, <220> has non-blank data 
L:162 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:162 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:162 M:282 E: Numeric Field Identifier Missing, <213> is required. 

L:197 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 4<211> 75<212> 
DNA<213> Escherichia coli<220><221> tRNA<222> 

L:200 M:282 E: Numeric Field Identifier Missing, <211> is required. 

L:200 M:282 E: Numeric Field Identifier Missing, <212> is required. 

L:200 M:282 E: Numeric Field Identifier Missing, <213> is required. 

L:200 M:200 E: Mandatory Header Field missing, <220> Tag not found for SEQ ID#:4 

L:207 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 5<211> 1545<212> 

DNA<213> Escherichia coli<220><221> CDS<222> 

L:210 M:282 E: Numeric Field Identifier Missing, <211> is required. 

L:210 M:282 E: Numeric Field Identifier Missing, <212> is required. 

L:210 M:282 E: Numeric Field Identifier Missing, <213> is required. 

L:210 M:200 E: Mandatory Header Field missing, <220> Ta'g not found for SEQ ID#:5 

L:345 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 6<211> 514<212> 

PRT<213> Escherichia coli 

L:346 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:346 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:346 M:282 E: Numeric Field Identifier Missing, <213> is required. 
L:346 M:283 W: Missing Blank Line separator, <400> field identifier 

L:447 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 7<211> 1545<212> 
DNA<213> Escherichia coli 

L:449 M:256 W: Invalid Numeric Header Field, <220> has non-blank data 
L:452 M:256 W: Invalid Numeric Header Field, <220> has non-blank data 
L:455 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:455 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:455 M:282 E: Numeric Field Identifier Missing, <213> is required. 

L:590 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 8<211> 514<212> 
PRT<213> Escherichia coli 

L:591 M:282 E: Numeric Field Identifier Missing, <211> is required. 
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L:591 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:591 M:282 E: Numeric Field Identifier Missing, <213> is required. 



m <« - 
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VERIFICATION SUMMARY DATE: 02/28/2006 

PATENT APPLICATION: US/10/567,749 TIME: 12:03:11 

Input Set : A:\pto.da.txt 

Output Set: N:\CRF4\02282006\J567749.raw 

L:591 M:283 W: Missing Blank Line separator, <400> field identifier 

L:692 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 9<211> 1548<212> 
DNA<213> Escherichia coli 

L:694 M:256 W: Invalid Numeric Header Field, <220> has non-blank data 
L:695 M:283 W: Missing Blank Line separator, <220> field identifier 
L:695 M:256 W: Invalid Numeric Header Field, <22 0> has non-blank data 
L:698 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:698 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:698 M:282 E: Numeric Field Identifier Missing, <213> is required. 

L:777 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 10<211> 141<212> 
PRT<213> Escherichia coli 

L:778 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:778 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:778 M:282 E: Numeric Field Identifier Missing, <213> is required. 
L:778 M:283 W: Missing Blank Line separator, <400> field identifier 

L:4 M:203 E: No. of Seq. differs, <160> Number Of Sequences : Input (10) Counted (9) / 



* 
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